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Abstract 

The Gcn5-related N-acetyltransferases (GNATs) are ubiquitously expressed in nature and perform a diverse range of cellular 
functions through the acetylation of snnall molecules and protein substrates. Using activated acetyl coenzyme A as a 
common acetyl donor, GNATs catalyse the transfer of an acetyl group to acceptor molecules including aminoglycoside 
antibiotics, glucosamine-6-phosphate, histones, serotonin and spermidine. There is often only very limited sequence 
conservation between members of the GNAT superfamily, in part, reflecting their capacity to bind a diverse array of 
substrates. In contrast, the secondary and tertiary structures are highly conserved, but then at the quaternary level there is 
further diversity, with GNATs shown to exist in monomeric, dimeric, or tetrameric states. Here we describe the X-ray 
crystallographic structure of a GNAT enzyme from Staphyloccocus aureus with only low sequence identity to previously 
solved GNAT proteins. It contains many of the classical GNAT motifs, but lacks other hallmarks of the GNAT fold including 
the classic p-bulge splayed at the p-sheet interface. The protein is likely to be a dimer in solution based on analysis of the 
asymmetric unit within the crystal structure, homology with related GNAT family members, and size exclusion 
chromatography. The study provides the first high resolution structure of this enzyme, providing a strong platform for 
substrate and cofactor modelling, and structural/functional comparisons within this diverse enzyme superfamily. 
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Introduction 

The Gcn5-related N-acetyltransferases (GNATs) are a very 
large enzyme superfamily with more than 10,000 members 
identified across all kingdoms of life [1]. They were first identified 
as aminoglycoside acetyltransferases from bacteria that developed 
antibiotic resistance to kanamycin and gentamicin [1]. The 
GNATs catalyse the transfer of an acetyl group from acetyl CoA 
to the primary amine of substrates including antibiotics amino- 
glycosides, glucosamine-6-phosphate, histones, serotonin, sperm- 
ine, spermidine, and other small molecules [2-6] . In spite of the 
substrate and functional diversity, the basic structure of GNAT 
members is highly conserved. Members of the family share a 
common fold known as the GNAT fold, comprised of 6-7 anti- 
parallel P-strands and 4 a-helices in the topology pi-otl-ot2-P2-P3- 
P4-a3-P5-a4-P6-P7. 

The GNAT fold contains four conserved motifs A-D, arranged 
in the order C, D, A and B in the primary sequence. The most 
highly conserved motif across the superfamily is motif A, followed 
by motif B, D, and C. The hallmark of motif A is the 'T-loop", 
connecting helix 0l3 and strand P4, which plays an essential role in 
binding the P-mercaptoethylamine and pantothenic acid moieties 
of acetyl-CoA. Motif B spans conserved regions within Ol-helix 4 
and is involved in binding the 3', 5 '-adenosine diphosphate of 



acetyl-CoA. Motif D, encompassing P2 and P3 strands is not 
directly involved in substrate or cofactor binding, but stabilises 
core structural elements of the protein, while motif C, located at 
the N-terminus of the protein, is the least conserved with some 
histone N-acetyltransferases lacking this motif completely. These 
motifs together comprise the common structural core known as 
GNAT domain. A signature of the GNAT fold is a splay between 
P4 and P5 strands, forming a V-shape opening in the central P 
sheet which is crucial in the transfer of acetyl group and binding of 
acetyl-CoA [7]. 

Whilst cell regulation through acetylation has been well 
characterised in eukaryotes, the role of protein acetylation within 
prokaryotes has only emerged recendy, providing support that 
acetylation based regulation is an important and universal process. 
Staphylococcus aureus, an important pathogenic and increasingly 
multi-drug resistant bacterium, contains 35 putative GNAT 
enzymes, many of which remain uncharacterised both functionally 
and structurally. It is also an opportunistic human pathogen and 
frequent cause of infection ranging from mild to life threatening 
illnesses including bacteremia, meningitis, osteomyelitis, liver 
cirrhosis, keratitis, pneumonia, septic phlebitis and endocarditis 
[8,9] . Moreover, rates of S. aureus infections have increased over 
past decade as has antibiotic resistance to commonly used 
antibiotics including rifampicin, vancomycin and methicUlin 
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Figure 1. Protein purification profile of 5aGNAT. (A) FPLC profile of the affinity purification, with SDS-PAGE insert showing lane 1 - whole 
bacterial cell lysate; lane 2 - soluble protein fraction of the bacterial cell lysate; lane 3 - flow-through from the affinity column; lane 4 - affinity elution. 
(B) Size exclusion purification indicating the theoretical elution volumes for monomer, dimer, and trimer, and an SDS PAGE insert lane 5 - showing 
the final purity of the protein. 
doi:1 0.1 371 /journal.pone.01 02348.g001 



[10,11]. Resistance towards the aminoglycoside antibiotics can 
occur through a range of mechanisms including aminoglycoside 
modifying enzymes, ribosomal mutations, or excretion of the 

Table 1. Data collection and refinement statistics. 



aminoglycoside. Aminoglycoside-modifying enzymes can inacti- 
vate antibiotics by covalently attaching either a phosphate, 
nucleotide, or acetyl moiety to either the amine or the alcohol 





Wavelength (A) 


0.9537 


Resolution range (A) 


33.15 - 2.15 (2.23 - 2.15) 


Space group 


C2 


Unit cell (A) 


97.49, 78.86, 66.01, 90, 1 1 1.95, 90 


Unique reflections 


25154 (2479) 


Multiplicity 


13.4 (13.0) 


Completeness (%) 


99.38 (98.72) 


Mean l/sigma(l) 


10.8 (6.0) 


Wilson B-factor 


17.67 


R-merge 


0.07 (0.12) 


R-work 


0.1854 (0.1925) 


R-free 


0.2226 (0.2648) 


RMSD (bonds) (A) 


0.007 


RMSD (anglesX ) 


1.02 


Ramachandran favored (%) 


99 


Ramachandran outliers (%) 


0 


Clashscore 


2.65 


Average B-factor (A^) 


21.60 


Macromolecules (A^) 


20.80 


Ligands (A^) 


22.80 


Solvent (A^) 


30.40 



Statistics for the highest-resolution shell are shown in parentheses. 
doi:l 0.1 371/journal.pone.Ol 02348.t001 



PLOS ONE I www.plosone.org 



2 



August 2014 | Volume 9 | Issue 8 | el 02348 



Structural Characterization of a GNAT from Staphylococcus aureus 




Figure 2. Tertiary structure of SaGNAT in cartoon format, with a-helices, p-strands, and loops colored in red, yellow, and green 
respectively. 

doi:10.1371/journal.pone.0102348.g002 



key functional group (or both groups) of the antibiotic, changing 
the charge or sterically liindering the antibiotic [12,13]. Thus, 
characterisation of proteins capable of playing a role in antibiotic 
resistance and regulatory functions within important pathogenic 
bacteria provides an important platform for rational drug design, 
development of new inhibitors, and an enhanced understanding of 
the putative functional roles. 

Here, we describe the structure of an uncharacterised, GNAT 
family member from S. aureus. Our structure confirms that the 
protein exhibits many of the classical GNAT motifs, has high 
structural similarity with the phosphinoacetyl GNAT proteins, and 
is likely to exist as a dimer in solution based on biophysical and 
crystallogxaphic properties. 

Materials and Methods 

Cloning and expression 

The gene encoding the putative N-acetyltransferase from S. 
aureus subsp. aureus Mu50 NP_373053 (SaGNAT) was PGR 
amplified from genomic DNA purchased from American Type 
Cell Culture (ATCC), and cloned into the expression vector 
pMCSG21. The fidelity of the clone was confirmed by DNA 
sequencing and the plasmid transformed into E. coli BL21 (DE3) 
pLysS for recombinant expression. A 5 ml Luria-Bertani (LB) 
broth starter culture containing 100 |J.g/ml spectinomycin was 
used to inoculate 500 ml of auto-induction media [14] containing 
100 |Xg/ml spectinomycin grown at 25°C for 24 h. The cells were 
harvested by centrifugation and the cell pellet resuspended in 
50 ml of His buffer A (50 mM Phosphate buffer pH 8.0, 300 mM 
NaCl, 20 mM Imidazole) and stored at -20°C. 

Protein purification and crystallisation 

The E. coli cells were lysed by 2 repetitive freeze-thaw cycles in 
the presence of 20 mg of lysozyme, and the lysate centrifuged at 
15,000 rpm for 30 min. The supernatant was filtered through a 
0.45 |J,m filter and the supernatant loaded onto a 5 ml Ni^"'" 
column (HisTrap HP, GE Healthcare) in His buffer A. Following 



extensive washing of the column (> 1 0 column volumes) in His 
buffer A, the protein was eluted using an increasing gradient of His 
buffer B (50 mM phosphate buffer pH 8.0, 300 mM NaCl, 
500 mM imidazole). Elution fractions were pooled and treated 
with tobacco etch virus protease (100 |J.L of 3.3 mg/mL) overnight 
at 4°C to remove the afifmity tag. The cleaved protein was further 
purified by size exclusion chromatography (Superdex 200 column, 
GE healthcare) in GST buffer containing 50 mM Tris, pH 8.0, 
and 125 mM NaCl). The fractions containing protein were pooled 
and concentrated to 27 mg/ml using an Amicon ultrafiltration 
device (MiUipore). The purity of the protein was assessed by SDS- 
PAGE and stored at — 80°C. Crystallisation screening was 
undertaking using the hanging-drop vapour-diffusion method 
and commercially available screens (Hampton Crystal Screen, 
PEG/Ion, Crystal Screen 2 and PEG/Ion 2). The drops contained 
1.5 (il of the protein, to which an equal volume of reservoir 
solution was mixed, and suspended over 300 |Xl of reservoir 
solution at 296 K. Plate shaped diffraction quality crystals were 
obtained in 1 M sodium acetate trihydrate, 100 mM HEPES 
pH 7.5, and 50 mM cadmium sulphate hydrate. 

Data collection, structure determination and refinement 

Crystals were flash-cooled at 100 K in liquid nitrogen with 
reservoir solution containing 30% glycerol as a cryoprotectant. 
Diffraction data were collected from a single crystal at the MX2 
crystallography beamline at the Australian Synchrotron. Data 
were indexed and integrated using iMOSFLM [15] and scaled in 
AIMLESS [16]. Molecular replacement was undertaken using 
Phaser [17] and chain A of PDB 2JLM (48% sequence identity) as 
a search model. Model building and refinement was performed in 
Coot [18] and Phenix respectively [19]. 

Results and Discussion 

Protein production and structure determination 

To determine the x-ray crystaUographic structure of SflGNAT, 
the gene encoding the protein was cloned into bacterial expression 
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Figure 3. Structure based alignment of the 5aGNAT (green) with 3 acyi-transferases and RMSD less than 1 A. 1YR0 (orange), 1BL1 
(blue), and 2JLM (magenta) are crystal structure of phosphinothricin acetyltransferase from Agrobacterium tumefaciens, 
Pseudomonas aeruginosa, and Acinetobacter bayiyi respectively. Blue and red boxes depict conserved CoA binding and active site residues 
respectively, yellow box indicate residues involved in dimer formation, and unfilled boxes represent strictly conserved residues. 
doi:10.1371/journal.pone.0102348.g003 



vector pMCSG21 [20] and recombinantly expressed as a 6-His 
tagged fusion protein in E. coli BL21 (DE3) pLysS. The protein 
was solubly over-expressed using the auto-induction method [14] 
(see Fig. I lanes 1 and 2), and a two-step purification incorporating 
affinity and size exclusion chromatography resulted in greater than 
95% purity (Fig. 1). SaGNAT protein crystals produced in 1 M 
sodium acetate trihydrate, 100 mM HEPES pH 7.5, and 50 mM 
cadmium sulphate diffracted to 2.15 A and were indexed and 
integrated in the space group C2, with unit cell parameters 
a = 97.5 A, b = 78.9A, c = 66.0A, a = 90°, |3= 112.0°, y = 90°. 
Molecular replacement using Phaser [17] and chain A of PDB 
model 2JLM was used to place 2 molecules in the asymmetric unit, 
corresponding to a Matthews coefficient of Vm 3.18 A'^ Da ' and 
61.4% solvent content [21]. Extensive model building and 
refmement using COOT [18] and Phenix [19] respectively 
produced a final model with an Rcrj-st and Rfjep 0.18 and 0.22 
respectively. All amino acid residues were modelled with the 
exception of the final C-terminal residue. Coordinate and 
structure factors have been validated and deposited to Protein 
Data Bank and assigned the PDB ID code 4MBU. Data-coUection 
and refinement statistics are summarized in Table 1. 

Structure of SaGNATIS 

The refined x-ray crystaUographic structure revealed SflGNAT 
to be an a/fi protein comprised of 4 ot-hehces and 7 (3-strands, 
with a topology [5l-otl-a2-(52-[53-(54-a3-[55-a4-[56-(37 (Fig. 2). All 
P-strands are arranged sequentially according to sequence, with 
the exception of fi7, located between strands fi5-6. Two central 



antiparaUel fi-sheets (fil-4 and (55-8) are splayed between (54 and 
fi5 to create a V-shape in the protein (Fig. 2). The two fi-sheets are 
held together at the V joint by hydrogen bonding located on the 
N-terminal residues in strands ^4-^5, and diverge at Ser*^'* and 
Ala . This signature feature of GNATs is stabilised by hydrogen 
bond interactions between water molecules and the amide N and 
carbonyl O atoms from the protein main chain. The N-terminal 
arm of the protein is comprised of an antiparaUel fi-sheet (|31-4) 
flanked by 3 a-helices (o(l, a2 on one side, a3 on the other), and 
the C-terminal arm is comprised of an antiparaUel sheet (P5-7) 
flanked by a4 on the same side as a3. 

To assess both the sequence and structural simUarities of 
SfflGNAT with other GNAT-proteins, BLAST and DALI searches 
were undertaken. A sequence homology search of the non- 
redundant database using BLASTP (2.2.28) revealed the most 
closely related enzyme to be a phosphinothricin N-acetyltransfer- 
ase (YP_008780792) from Bacillus cereus, sharing 60% sequence 
identity. This low sequence identity between the two closest 
related homologues is not unusual in the GNAT famUy, with 
subfamUies weU documented to have highly variable amino-acid 
sequences, yet retaining very high structural homology [1]. In 
support of this, a structural homology search using DALI revealed 
3 proteins with an rmsd of less than 1 A, aU corresponding to 
phosphinothricin acetyltransferases (PDB lyrO/rmsd 0.8/40% 
sequence identity; PDB 2jlm/rmsd 1.0/48% sequence identity; 
PDB 2bll/rmsd 0.9/47% sequence identity). The structural 
overlay and alignment of these proteins is presented in Fig. 3, 
with the conserved active site and CoA binding site residues 
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Figure 4. Quaternary structure of SaGNAT showing interacting residues at tKie dimer interface. 

doi:1 0.1 371 /journal.pone.01 02348.g004 



highlighted based on liomology with other GNAT family 
members. 

Quaternary structure of SaGNAT 

SaGNAT is likely to exist as a dimer based on the crystal 
structure, structural similarity with homologous proteins, and 
elution profiles fi-om size exclusion chromatography. In the 
asymmetric unit of the crystal, two SaGNAT molecules were 
present with a buried surface area of 1,397 A^, strongly suggesting 
that this interaction is biologically relevant. Analysis of the 
inteferaces within the crystal using PISA (Proteins, Interfaces, 
Structures and Assemblies) also predicted this dimer configuration 
is likely to represent the biological unit, with other possible 
crystallographic contacts displaying less than 200 A^ of surface 
area. Consistent with this result, the structural homology search 
above confirmed that the proteins with an rmsd of less than 1 A 
also exist in the same dimeric configuration. Finally, the elution 
profile during size exclusion chromatography supports that the 
protein exists as a dimer in solution (Fig. IB). The full dimer 
conformation is presented in Fig. 4, and detailed interactions that 



mediate the dimer binding are also described. Briefly, the binding 
interface in comprised Ala75:Tyr28/Tyrl46; Tyr30:Gln77; 
Arg71:Glu81; Thrl40:Alal38; Thrl 14:Thrl42/Asnl43; 
Thr79:Vall44; Thrl42:Glul58; Aspl60:Asnl43. 

Conclusion 

Here, we describe the 2.15 A structure of a GNAT family 
member within S. aureus. The structure confirms that the protein 
exhibits the core GNAT fold, and has high structural homology 
with phosphinothricin acetyltransferases. Consistent with this, the 
closest homologue identified by BLAST sequence analysis, was 
also a phosphinothricin acetyltransferase. Putative residues 
involved in acetyl-CoA and have been identified based on 
structural homology within the GNAT family. 
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